Prediction of human microRNA hairpins using only positive sample learning
نویسندگان
چکیده
MicroRNAs (miRNAs) are small molecular non-coding RNAs that have important roles in the post-transcriptional mechanism of animals and plants. They are commonly 21-25 nucleotides (nt) long and derived from 60-90 nt RNA hairpin structures, called miRNA hairpins. A larger number of sequence segments in the human genome have been computationally identified with such 60-90 nt hairpins, however the majority of them are not miRNA hairpins. Most existing computational methods for predicting miRNA hairpins are based on a two-class classifier to distinguish between miRNA hairpins and other sequence segments with hairpin structures. The difficulty of these methods is how to select hairpins as negative examples of miRNA hairpins in the training dataset, since only a few miRNA hairpins are available. Therefore, these classifiers may be mis-trained due to some false negative examples of the training dataset. In this paper, we introduce a one-class support vector machine (SVM) method to predict miRNA hairpins among the hairpin structures. Different from existing methods for predicting miRNA hairpins, the one-class SVM classifier is trained only on the information of the miRNA class. We also illustrate some examples of predicting miRNA hairpins in human chromosomes 10, 15, and 21, where our method overcomes the above disadvantages of existing two-class methods.
منابع مشابه
Prediction of Conserved Precursors of miRNAs and Their Mature Forms by Integrating Position-Specific Structural Features
MicroRNA (miRNA) precursor hairpins have a unique secondary structure, nucleotide length, and nucleotide content that are in most cases evolutionarily conserved. The aim of this study was to utilize position-specific features of miRNA hairpins to improve their identification. To this end, we defined the evolutionary and structurally conserved features in each position of miRNA hairpins with heu...
متن کاملMicroRNA Prediction Using a Fixed-Order Markov Model Based on the Secondary Structure Pattern
Predicting miRNAs is an arduous task, due to the diversity of the precursors and complexity of enzyme processes. Although several prediction approaches have reached impressive performances, few of them could achieve a full-function recognition of mature miRNA directly from the candidate hairpins across species. Therefore, researchers continue to seek a more powerful model close to biological re...
متن کاملEvaluation of Extracellular Circulating Human MicroRNA-197 as a Target Biomarker in Patients with Coronary Artery Disease
Background: Coronary Artery Disease (CAD) refers to the reduction or blockage of all or part of the coronary arteries due to the process of atherosclerosis or the presence of a clot. The aim of this study was to investigate the association of serum miR-197 as a diagnostic index in patients with coronary artery disease. Methods: In this study, 100 patients with CAD were selected. Extraction of...
متن کاملVir-Mir db: prediction of viral microRNA candidate hairpins
MicroRNAs have been found in various organisms and play essential roles in gene expression regulation of many critical cellular processes. Large-scale computational prediction of miRNAs has been conducted for many organisms using known genomic sequences; however, there has been no such effort for the thousands of known viral genomes. Some viruses utilize existing host cellular pathways for thei...
متن کاملA novel over-sampling method and its application to miRNA prediction
MicroRNAs (miRNAs) are short (~22 nt) non-coding RNAs that play an indispensable role in gene regulation of many biological processes. Most of current computational, comparative, and non-comparative methods commonly classify human precursor microRNA (pre-miRNA) hairpins from both genome pseudo hairpins and other non-coding RNAs (ncRNAs). Although there were a few approaches achieving promising ...
متن کامل